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To -provide a mathematical tool for the evaluation of cable pairs, this 
paper suggests a quality measure which is based on information theory. 
While a group of cable pairs (paired wires) with a given gauge and con- 
struction are nominally equivalent, manufacturing tolerances and differences 
in installation and environment lead to variation from the nominal param- 
eters. 

To measure the quality of a group of channels {for example, cable pairs 
leaving a central office), this paper recommends the following procedure. 
Choose a fraction p of the original group and evaluate the mutual informa- 
tion between input and output of each channel in the subgroup, subject 
to the in-put to each channel being chosen from the same process. Then, by 
choosing the proper process, maximize the smallest mutual information in 
the subgroup. This largest possible minimum mutual information is a 
quality measure for the subgroup. Next, apply this measure to all subgroups 
of fractional size p; the subgroup with the highest measure provides the 
numerical value of the quality measure for the original group relative to 
fraction p. Repeat this procedure for all p (0 ^ p ^ 1). The resulting 
function is the suggested quality measure of the group. 

To illustrate the above measure of quality, we derive the capacity of an 
ensemble of channels with stationary Gaussian inputs, additive noise, 
and crosstalk. In the Appendix we derive the capacity of a single such 
channel. 

I. INTRODUCTION 

Cables are usually analyzed as if all of the components had a par- 
ticular set of parameters (for example, nominal, worst case, and so on). 
Because of manufacturing tolerances, installation differences and various 
environmental effects, however, transmission parameters actually vary 
from pair to pair. To account for these variations, this paper takes an 
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approach based on information theory. We consider the cable network 
to be a statistical population of channels which have parameters that 
vary from channel to channel. We propose a quality measure for the 
network based on this model. 

After denning channel capacity, we present the suggested capacity 
definition for a group of channels (as outlined in the Abstract). Using 
this definition, we provide an example in Section IV in which the 
capacity of a group of channels with stationary Gaussian inputs, additive 
noise and crosstalk is derived. The capacity for a single such channel is 
found in the Appendix. In Section 4.1 the crosstalk is assumed to differ 
from channel to channel; in Section 4.2 the channel attenuation is 
assumed variable; and in Section 4.3 both the crosstalk and attenuation 
vary from channel to channel. 

These results indicate the trade-off between design rate for a trans- 
mission system and the expected fraction of channels which will be 
capable of error-free transmission at the design rate. 

This technique can also be used to evaluate different parameter dis- 
tributions as may result from tighter production controls. 

II. CHANNEL CAPACITY 

A channel is defined as a probabilistic mapping of one stochastic 
process onto another (for our problem we consider the processes to be 

time functions). (See Fig. 1.) 

Let 

s(t) be the input stochastic process, 
r(t) be the output stochastic process, 
s T = {8(0 :*e[- 7/2, T/2]}, 
s be s„ . 

Then the operation of the channel can be written in terms of a prob- 
abilistic mapping F as 

F{s} = r. (1) 

The capacity is defined as the maximum (over input processes) mutual 
information between input and output, that is, 



s(t)- 



CHANNEL 



•r(t) 



Fig. 1 — Channel model. 
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C = sup lim sup „ I(r T , s r ) = sup (I(r, s)) (2) 

a T-x> J- » 



where 



(I(r, s)> s lira sup ~ 7(r r , s r ) . (3) 

/(?> , s T ) is the mutual information* between s T and ra- 
the supremum is taken over all possible distributions of input signals 
subject to some constraint (for example, fixed power), 
and for a large class of channels including memoryless channels and 
colored Gaussian channels C is the maximum information rate (that is, 
the maximum error free transmittable rate). 

The maximization of equation (2) will yield not only C, but more im- 
portantly perhaps, the properties of s(t) which will achieve C. 

III. CAPACITY DEFINITIONS FOR A GROUP OF CHANNELS 

Now, consider the extension of the capacity definition to a group of 
channels. Capacity is dependent not only on the nature of the channel, 
but also the nature of the constraints placed on the input. For different 
sets of input constraints, different capacities will be obtained. This 
section contains two possible alternate capacity definitions [equations 
(5) and (6)] followed by the recommended definition [equation (10)]. 

A natural extension of equation (2) to a class of channels (formally 
the set {co : w e ft}) 

F M { S ) = r lu) , coeft 

(that is, F (u) is the mapping corresponding to channel w) would be to 
define the capacity as the sum of the individual capacities, or the aver- 
age capacity for an infinite set ft. That is, the capacity of each channel is: 

C M =sup </(r (w \ s (u) )), 

( (u) 

where the supremum is performed for each channel separately, con- 
strained as before. Then one measure of the capacity of the ensemble 
could be the total capacity (for a finite set) : 

c T = zr. (4) 

tit Q 

Another measure could be the average per-channel capacity 



* See for example Gallager. 1 
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(C)r = E{C la) ), (5) 

where E denotes the expectation. 

As for the single channel, equation (4) [or (5)] yields both C T (or (C) T ) 
and the properties of the set S = \s ia) : co e ft) which will achieve C T 
(or (C%). This number defines the maximum transmittable rate when 
the input processes are chosen for each channel individually. In many 
instances this may not be a practical measure in that it may be desirable 
to use a single signaling set on all members of ft. A measure using a 
single signaling set has been suggested in the literature:" 

C B = supinf</(r (c ",s)). (6) 

a aid 

The desirable property of C B is that it will result in a signal distribu- 
tion which, when applied to any member, weft, will permit transmission 
at rates arbitrarily close to C B with arbitrarily small probability of error. 
That is, C B is the maximum rate which will work on all members of the 
group when one process is sent over all channels. However, this seems 
to be an overly pessimistic measure in that if ft should have even one 
member with poor transmission properties, C B will reflect this single poor 
member in exactly the same way as if all of ft were equally bad. 

To overcome this difficulty a new capacity definition is introduced. 
This definition is actually a fimction rather than a single number for the 
group of channels. This definition is essentially C B , restricted to the 
best subset, of size p, of the original group of channels, as a function 
of p. To formalize this notion: 

Let ft x (p) be a subset of ft (indexed by X) of fractional size p. That is, 
for weft 

Pr {a>:co E fi x (p)} = P- (7) 

Let ft(p) be the set of all such subsets: 

Q(p) = {fi x (p)|. (8) 

Find C B for each subset ftx(p): 

C B [Qx(p)\ = sup inf (I(r (u) ,s)). (9) 

• wtQ\(p) 

Finally consider the supremum over all such subsets, that is 
C(p) = sup sup inf (I(r (a) ,s)) 

X « utfixtp) 

= SUp Cj,n x . 0°) 

X 
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Note that equation (10) can also be written as: 

•\s)>. 



C(p) = sup sup 

i x 



inf (I(r { 

tS)x(p) 



The following coding theorem can be proved almost by inspection: 
C(p) is the supremum of rates which can be transmitted error free over 
at least the fraction p of the original ensemble of channels when the 
inputs to all channels are from the same signal distribution. (Clearly, 
for larger p, more channels are required to be capable of error free 
transmission at rates arbitrarily close to C(p) than for smaller p. There- 
fore, C(p) decreases as p increases, more of the set of poor channels 
included.) 

C(p) would then be plotted as in Fig. 2 which intentionally represents 
an ensemble for which most of the channels have near nominal param- 
eters, a small percentage have worse parameters, and a small percentage 
better parameters. From the figure, if all channels must have error-free 
transmission, then the design rate for any system can be no greater 
than C B • However, if design criterion only requires p x of the channels 
to be error free then rate C { can be used. Similarly, if only p 2 of the 
channels need operate without errors, rate C 2 can be used. 

C(p) is a useful measure for the following reasons: 

(i) If it is desired that a given fraction of the channels have satis- 
factory transmission, then C(p) indicates the maximum permissible 
rate. 

(it) If the objective is to provide a given transmission rate C(p), 
then the value of p indicates what percentage of the channels will be 
capable of operating without errors. 



C(p) 




P2 Pi ' 

ERROR- FREE FRACTION 



Fig. 2 — C(p) vs. p. 
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(in) When deciding between two alternative groups of channels, 
C(p) can indicate those transmission rates for which one type is better 
than the other. 

IV. EXAMPLE 

To provide an example for the use of the C(p) function, consider an 
ensemble of channels, each of which can be modeled as shown in Fig. 3.* 

where 

H(u>) is the channel transfer function, 

X(u) is the crosstalk transfer function, 
s(t) is the signal with one-sided power spectral density S s (o)), 
s(t) is the signal on an adjacent channel with the same power spec- 
tral density, and 
n(t) is the noise with power spectral density S N (oi). 

If only stationary Gaussian inputs are considered the average mutual 
information between the input and output of any channel in the en- 
semble is: 3,4 

where 

S s (u>) is the input signal power spectral density, 

SM = S s (o>) | H(u) I s (1 + I X(«) | 2 ) + £*(«)« (12) 

is the output power spectral density, and 

SsM = S a (u)B*(u), (13) 

is the input-output cross power spectral density. 
Then, 

</(r, s)) = -I 

r . [\ sib) i g( M ) | 2 1 . 

Jo g L S 2 s (o>) I H(u) I 2 (1 + I Z(«) | 2 ) + Ss(<*)S„(o>)i au ' 

(14) 



* a> is used here to represent frequencj' and not probability spaces as in the 
first part of this paper. 
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Fig. 3 — Channel model. 



Define 



Then 



S,v(w) = 



(I(r, s)) = ^ 



£/>[ 1 + 



H(fo) I 



S 

X(w) | 2 s t 



(15) 



S («J + S. v (w)J 



4.1 Fi&ed A^o?'se arccZ Channel, Distributed Crosstalk 

Assume that the crosstalk parameters of all of the cables are the 
same except for a multiplier, that is: 



Z|(w) | = €{ | X(<a) |. 



(17) 



Equation (16) indicates that for any choice of S s (a), (I(r, s)) de- 
creases as e £ increases. Thus, while equation (9) requires a minimization 
followed by a maximization, it is clear in the case that the "inf" for a 
given p occurs for the largest e £ in the subset £2 x (p). Further, the "sup" is 
achieved using the spectrum that achieves capacity on the channel 
with the largest e { . Finally, the "sup" in equation (10) is achieved by 
choosing the £2 x (p) such that ^ e ^ e p where e p is such that 



V = \ P.i(e) de, 



(IS) 



where p« t (e) is the probability density of e £ . Thus using the capacity 
result obtained in the Appendix 

5 («) = [S. v (aw) - S.v(co)][l - H\X(u) \\el\X(o>) | 2 + 1) + •••]• 

(19) 
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The capacity function can now be found: 

+ * log {l + 44 | X(«) | [el | X(«) | 2 + 1] ^^}] &»}' 

(20) 
Consider the following example to illustrate the above: 
Let S N (u>) be zero. Then from equation (16) the mutual information 
is independent of the signal spectrum and 



c w-s^ ,o «[ 1 + ^lWl , ] d •• 



(21) 



(This is achieved for any spectrum with finite power which is strictly 
greater than zero for all frequencies.) Let 

| Z(co) | 2 = co 2 (22) 

and let log e„ be normally distributed with mean —8.2 and <x = 0.17. 
(The crosstalk figures are idealizations of typical figures for 22 gauge 
PIC). Then 



where 



and 



^ rf ro.S M M°" 



*«- S3* £•""■*• (24) 



<W-S¥a (inbite) 

= ^- (innats). (25) 

These results are indicated in Fig. 4. For p = 0.8, C(p) is 6.3 X 10 7 . 
For p = 0.1, C(p) is 1.3 X 10 8 . Thus a factor of two is reahzed in ca- 
pacity by reducing the required usable fraction of channels by a factor 
of eight. 

4.2 Fixed Crosstalk and Noise, Distributed Channel 

Assume 

SM = e N , (26) 
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2.0X10 8 




0.2 0.4 0.6 0.8 

ERROR-FREE FRACTION 



Fig. 4 — Capacity vs. fraction of satisfactory channels. 



(27) 



and that «{(co) is such that 

a fl (oj.) > aj.(o),) => «{,(&>,-) ^ Qf fl (aj/). (28) 

That is, if the attenuation of channel £i is greater than the attenuation 
of £ 2 at one frequency, it is greater at all frequencies. Then, again design 
for the worst case, that is, 



V = \ P ai (a) da. 
Jo 



(29) 



(Again the smaller a, the "better" the channel.) 

_2o,(«o.»i) -2<Jp(u>) 



6 (co) - e*[e a "' ( -" J - e 2 - (u, ][l - 1 I X( a ) | 2 (| X(«) | 2 + 1)], (30) 



and 



^=^{/r{ iog [ i+ w?] 

+ h log [l + 4 | X(«) | 2 (| X(«) | 2 + 1) ^"("T]} rfw }' (31) 

4.3 Fixed Noise, Distributed Channel and Crosstalk 

Proceeding as in the previous two sections, choose the signal spec- 
trum to achieve capacity on a channel with a particular e„. and a*. , 
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that is: 

flo.(«) = €iv[exp [2a Pl (ow)] - exp [2a Pl (f>)]] 

•[i-£4l*(")l 2 (4IWl 2 + i)]. (32) 

+ \ log [l + 4e p . | Z(«) | 2 (4 | Z(«) | 2 + 1) 



exp [2a Pl (6w)] 
exp [2a P »] J 



(33) 



Here, p is found by integrating the joint density of a^ and e E over the 
region where the mutual information is greater than C z (p). That is, 

V = jf p. ( , a( (e,a)deda (34) 

where R is the region in the (e, a) space, where 

J- f ' " log Tl + , w x ,» , 1 5-7-xl dw = C '(P>- ^ 

2rJ b L «« I ^(w) I + e ^ ex P 2a t (w) J 

The difficulty with this problem at this point is that there is no 
unique e Pl and a,,, which yield a given p in equation (34). Thus it is 
necessary to search all such sets (e Pl , a,,) which yield the same p, and 
choose the one which yields the largest C x (p). While this may seem 
difficult, it can be implemented with a computer search. 

V. CONCLUDING REMARKS 

The quality measure suggested herein should be considered part of 
the development of a technique for comparison of groups of cables. 

It should be pointed out that, while the definition of capacity for a 
single channel is straightforward, the definition of capacity for an 
ensemble of channels depends on certain assumptions concerning the 
use of the channels. 

(C) T , the average per channel capacity, [equation (5)1 assumes each 
channel will be used individually, and optimized individually. C B , 
BlackwelTs definition, [equation (6)] assumes the same input distribu- 
tions will be used for all channels in the group and that the channels are 
used individually. Further, no more than the minimum rate transmit- 
table over the worst channel will be sent over each channel. 
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The recommended definition, C(p), equation (12), is similar to C B 
except that the fraction 1 — p of the worst channels will not work 
satisfactorily and could be discarded. 

The example carried out in Section IV uses a new result on the ca- 
pacity of a single channel with crosstalk and additive Gaussian noise 
which is derived in the Appendix. The discussion on the calculus of 
variations, contained before equation (40), can be used to rigorously 
prove some old results for optimum spectra in the presence of additive 
Gaussian noise [see footnote, following equation (52)]. 

In applying C(p), it is noted that when only one parameter is un- 
known (for example, the magnitude of the crosstalk), the capacity of 
the group of channels as a function of p (the fraction of usable channels) 
is a simple calculation. However, when more than one parameter is 
unknown (for example, the magnitude of the crosstalk as well as the 
channel attenuation), search techniques are indicated. 

APPENDIX 

Capacity Calculation for a Single Channel 

Consider the channel model of Fig. 3. The mutual information ex- 
pression is given in equation (16) and is repeated below: 



(I(r, s)> = ;f f M log 1 + I yy K i2 ^ T o y W 
N v " 2w Jq _ A(w) 6 s (w; + Sa-(w)J 



(36) 



Now capacity for this channel is defined as 

C = sup </(r,s)>, (37) 



■S s (u>> 

subject to 



1 f w 

— — / S s (co) du ^ P = power, 

27T J 



and 

S s (co) ^ for all w. (38) 

The method of variational calculus can be applied to determine the 
optimum solution. 
Let 

S a (o>) be the assumed optimum solution, and 
8- e(w) be a small perturbation, then 
£s(co) = S (fi>) + M«). 
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In order to account for S s (o)) ^ 0, 5-e(to) must be nonnegative when- 
ever S (o)) = 0. That is, in order that $ s (co) be a power spectral density, 
whenever S (u>) = 0, the perturbation at that frequency must be such 
that the resulting density at that frequency remain nonnegative. Note 
that since *S (o>) is to be the optimum solution, (I(r, s)) must be a 
maximum at 8 = for all permissible 8-e(u>). This implies that 
d(I)/db | « = o will be negative for all permissible S-e(co) which approach 
zero from the positive side (S-e(co) — > + ), and d(I)/d8 \ s=0 will be 
positive for 5-e(co) — > 0~. Or finally, d(I)/d8 [ 8 _ = whenever 8-e(u>) 
can approach zero from either the positive or negative side and the 
derivative is defined. Now, 



2tt 



88 



d8 



1 + 



1 



X(«) | 2 + 



&y(a)) 



+ 



88 



/»+oo 

Jo bg 

|AWI "*" ,S'„(eo) + S-efa) 

~M {/ [Sofa) + «■€(&))] (fco - 2ttP 



aco 



(39) 



where n is a Lagrangian multiplier used to introduce the power con- 
straint. 
Or 



as 

where 



,-o " I €(w) LA(co)^(co) + *£)&(«) + $%(<*) - M J *"» (40) 



A(«) = | Z(«) | 2 (| X(«) | 2 + 1) , 
B(u) = [2 |X(w) | 2 + l]8jy(ca). 

In equation (20) one now considers all permissible e(a>). Whenever 
*S (w) is nonzero (that is, 5-e(w) is unrestricted), the integrand is zero, 
since for these frequencies, e(w) is arbitrary. However, when S (a)) 
is zero (5- e(o) ^ 0) all that can be said is that the integrand is negative. 
That is, as one approaches the boundary of permissible 5-e(oj), (I(r, s)) 
must be monotonically increasing. 

This yields: 



§,v(co) 



= M 



A(u)Sl(o>) + B(a>)S (a>) + §Uo>) 

for all to such that 5 (oj) 5* 0, (41) 

and 
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-|r - n ^ for all « such that £ (co) = 0. (42) 

§Ar(co) 

(This equation is simply the integrand for S (u>) = 0.) 
Or from the above for *S (co) ^ 0, 

Q .. -f?(co) ± |fl 2 (co) - 44(co)C(co)|* 
S (co) - 2A(w) 

where 

C(«) = 8^(«) - &r(«)/fi. (43) 

Now, & (co) must be nonnegative. In order that equation (43) yield a 
nonnegative result, the positive root must be taken, and further the 
following inequality must be satisfied: 

{£ 2 (co) - 4i4(«)C(co)} } ^ B(«). (44) 

Relation (44) imphes 

C(w) g for all co such that 5 (w) > 0. (45) 

Relation (42) (which imphes 1/ju ^ 0) can be rewritten as 

C(co) ^ for all co such that S (w) = 0. (46) 

If we assume that Sjv(co) is a monotonically increasing function* of co, 
relations (45) and (46) imply 

- = S iV (aw), (47) 

M 

and 

£ (co) = co ^ co mai . (48) 

Hence f rewriting equation (43) 

- , s , M il + 4A(co)S„(co max )/S„(co))> ~ 11 + 4,4(co)}* , 

<S (co) = Sjv(co) 2il(w) 

where co mux can be found from 

±- /"""" 5 (co) rfco = P. (50) 

2tt J 

This can be used to obtain a relation between co raaI and P. 



* This is not at all necessary. It just simplifies the form of the following equations. 
+ In what follows, it is understood that the expression given for S (w) holds for 
w ^ coma,, and that 5 (w) = otherwise. 
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Now if the crosstalk is small [|X(co) | <$C 1], each of the radicals in the 
equation for Sofa) can be approximated by the first few terms in the 
binomial expansion. Then 



Sofa) = [SA'(o) mux ) — B N fa)] 



1 - i | Xfa) | 2 (| Xfa) | 2 + 1) 



T!ii( W )F(H( U )r + i)(^-i) + -], 



(51) 



Sofa) S [S„(aw) - S*(«)][l - 1 | X(«) | 2 (| X(«) | 2 + 1)], 

^ [M&w) - **(»)]. (52) 

This result indicates that for small crosstalk, the signal spectrum ought 
to be designed independent of the crosstalk spectrum. Equation (52) 
is the familiar "spectrum filling" result for additive Gaussian noise.* 
This is shown in Fig. 5. The exact solution [equation (51)] (superimposed 







EXACT 
SOLUTION 


s 




S (w)+ S N (w) 




i-— - 


•A 

Sm(w) 




"" 



















Fig. 5 — Typical optimum spectrum 1. 

with the broken lines) simply implies some shaping. Figure 6 illustrates 
the solution when S N (a>) is not monotonically increasing. 
With the solution obtained for S fa) 

Capacity = £ {( ' log [l + Jj^-jiJ d« 

+ | /"" log {l + 4 | Xfa) | 2 [| Xfa) | 2 + 1] ^^} &,}• (53) 
Using the small crosstalk approximation for £ (co): 



* This result is contained in Fano 5 , pp. 173ff. That proof is not as rigorous as 
the one presented here as Fano does not prove that the optimum spectrum is 
zero whenever the noise is greater than a threshold. Fano's result can be proved 
directly by noting the discussion preceding equation (21) herein. 
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Fig. 6 — Typical optimum spectrum 2. 



Capacity 



Ztt J i, 



1 + 



S iV (ow) ~ S A .(w) 



Z 2 (o>) I [S A -(ow) - S jV (co)] + S 



w(")J 



</w, 



1 + 



S.v(t»m.») — Sjy(&>) 1 ; 

A» | 2 S iV (w m „) + S jV (co)J "' 






(54) 



(Note that even for small crosstalk, capacity is still a function of the 
crosstalk spectrum.) 
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